Determinization in Monte-Carlo Tree Search for the card game

نویسندگان

  • Dou Di Zhu
  • Edward Powley
  • Daniel Whitehouse
  • Peter Cowling
چکیده

Monte-Carlo Tree Search (MCTS) is a class of game tree search algorithms that have recently proven successful for deterministic games of perfect information, particularly the game of Go. Determinization is an AI technique for making decisions in stochastic games of imperfect information by analysing several instances of the equivalent deterministic game of perfect information. In this paper we combine determinization techniques with MCTS for the popular Chinese card game Dou Di Zhu. In determinized MCTS, there is a trade-off between the number of determinizations searched and the time spent searching each one; however, we show that this trade-off does not significantly affect the performance of determinized MCTS, as long as both quantities are sufficiently large. We also show that the ability to see opponents’ hidden cards in Dou Di Zhu is a significant advantage, which suggests that inference techniques could potentially lead to much stronger play.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning to Play Hearthstone Using Machine Learning

The subject of this thesis is a new game called Hearthstone. It is a strategy card game developed by Blizzard Entertainment, in which players duel with each other with cards they collected. The game of Hearthstone provides a challenge for developing an artificial intelligence (AI) agent. The agent has to be able to deal with unknown information and stochastic events in a large search space. In ...

متن کامل

Monte-Carlo Tree Search for the Game of "7 Wonders"

Monte-Carlo Tree Search algorithm, and in particular with the Upper Confidence Bounds formula, provided huge improvements for AI in numerous games, particularly in Go, Hex, Havannah, Amazons and Breakthrough. In this work we study this algorithm on a more complex game, the game of “7 Wonders”. This card game gathers together several known difficult features, such as hidden information, N-player...

متن کامل

On the Chances of Completing the Game of "Perpetual Motion"

This brief paper describes the single-player card game called “Perpetual Motion” and reports on a computational analysis of the game’s outcome. The analysis follows a Monte Carlo methodology based on a sample of 10,000 randomly generated games. The key result is that 54.55% ± 0.89% of games can be completed (by a patient player!) but that the remaining 45.45% result in non-terminating cycles. T...

متن کامل

Intelligent System for Playing Tarok

We present an advanced intelligent system for playing three-player tarok card game. The system is based on alpha-beta search with several enhancements such as fuzzy transposition table, which clusters strategically similar positions into generalised game states. Unknown distribution of other players' cards is addressed by Monte Carlo sampling. Experimental results show an additional reduction i...

متن کامل

Upper Confidence Trees with Short Term Partial Information

We show some mathematical links between partially observable (PO) games in which information is regularly revealed, and simultaneous actions games. Using this, we study the extension of Monte-Carlo Tree Search algorithms to PO games and to games with simultaneous actions. We apply the results to Urban Rivals, a free PO internet card game with more than 10 millions of registered users.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011